Search CORE

33 research outputs found

Quantifying the Impact of Non-Stationarity in Reinforcement Learning-Based Traffic Signal Control

Author: Alegre Lucas N.
Bazzan Ana L. C.
da Silva Bruno C.
Publication venue
Publication date: 09/04/2020
Field of study

In reinforcement learning (RL), dealing with non-stationarity is a challenging issue. However, some domains such as traffic optimization are inherently non-stationary. Causes for and effects of this are manifold. In particular, when dealing with traffic signal controls, addressing non-stationarity is key since traffic conditions change over time and as a function of traffic control decisions taken in other parts of a network. In this paper we analyze the effects that different sources of non-stationarity have in a network of traffic signals, in which each signal is modeled as a learning agent. More precisely, we study both the effects of changing the \textit{context} in which an agent learns (e.g., a change in flow rates experienced by it), as well as the effects of reducing agent observability of the true environment state. Partial observability may cause distinct states (in which distinct actions are optimal) to be seen as the same by the traffic signal agents. This, in turn, may lead to sub-optimal performance. We show that the lack of suitable sensors to provide a representative observation of the real state seems to affect the performance more drastically than the changes to the underlying traffic patterns.Comment: 13 page

arXiv.org e-Print Archive

ScholarWorks@UMass Amherst

Lume 5.8

PubMed Central

Sample-Efficient Multi-Objective Learning via Generalized Policy Improvement Prioritization

Author: Alegre Lucas N.
Bazzan Ana L. C.
da Silva Bruno C.
Nowé Ann
Roijers Diederik M.
Publication venue
Publication date: 23/03/2023
Field of study

Multi-objective reinforcement learning (MORL) algorithms tackle sequential decision problems where agents may have different preferences over (possibly conflicting) reward functions. Such algorithms often learn a set of policies (each optimized for a particular agent preference) that can later be used to solve problems with novel preferences. We introduce a novel algorithm that uses Generalized Policy Improvement (GPI) to define principled, formally-derived prioritization schemes that improve sample-efficient learning. They implement active-learning strategies by which the agent can (i) identify the most promising preferences/objectives to train on at each moment, to more rapidly solve a given MORL problem; and (ii) identify which previous experiences are most relevant when learning a policy for a particular agent preference, via a novel Dyna-style MORL method. We prove our algorithm is guaranteed to always converge to an optimal solution in a finite number of steps, or an

\epsilon

-optimal solution (for a bounded

\epsilon

) if the agent is limited and can only identify possibly sub-optimal policies. We also prove that our method monotonically improves the quality of its partial solutions while learning. Finally, we introduce a bound that characterizes the maximum utility loss (with respect to the optimal solution) incurred by the partial solutions computed by our method throughout learning. We empirically show that our method outperforms state-of-the-art MORL algorithms in challenging multi-objective tasks, both with discrete and continuous state and action spaces.Comment: Accepted to AAMAS 202

arXiv.org e-Print Archive

Analysis of single nucleotide polymorphisms in the FAS and CTLA-4 genes of peripheral T-cell lymphomas

Author: A Attygalle
A Ligers
AH Sharpe
AI Bolstad
Alexander Marx
Andreas Rosenwald
AY Savinov
B Do
B Torres
Bernhard Puppe
BM Carreno
BS Shastry
C Beltinger
C Giordano
CE Jackson
CG Vinuesa
CM Rudin
DR Green
DT Teachey
E Geissinger
E Geissinger
EL Masteller
ES Jaffe
Eva Geissinger
F Rieux-Laucat
F Sallusto
G Cutrona
G Cutrona
GH Fisher
H Ueda
Hans Konrad Müller-Hermelink
HC Lai
HF Harbo
I Behrmann
I Maric
Irina Bonzheim
J Braun
J Dupuis
JC Barrett
JE Niemela
K Geleijns
K Gronbaek
KL Grogg
L Lander EKruglyak
L Mullauer
L Nistico
L Pensati
LL Hudson
M Cutrona GFerrarini
M Lucas
M Monne
Martin Wilhelm
MC Puck JMSneller
MC Sneller
MJ Lenardo
ML Alegre
MS Shin
N Erfani
OH Kantarci
P Depraetere VGolstein
P Strobel
Peter Reimer
Philipp Ströbel
R Lai
RL Nolsoe
RL Nolsoe
S Anjos
S Jayaraman
S Kanemitsu
Sabine Roth
SC Gough
SE Straus
T Horiuchi
T Kouki
T Oliveira JBFleisher
TH Landowski
Thomas Rüdiger
Wen-Yu Chuang
WY Chuang
Y Choi
Y Ma
Publication venue: Springer-Verlag
Publication date: 01/01/2008
Field of study

Angioimmunoblastic T-cell lymphoma (AILT) represents a subset of T-cell lymphomas but resembles an autoimmune disease in many of its clinical aspects. Despite the phenotype of effector T-cells and high expression of FAS and CTLA-4 receptor molecules, tumor cells fail to undergo apoptosis. We investigated single nucleotide polymorphisms (SNPs) of the FAS and CTLA-4 genes in 94 peripheral T-cell lymphomas. Although allelic frequencies of some FAS SNPs were enriched in AILT cases, none of these occurred at a different frequency compared to healthy individuals. Therefore, SNPs in these genes are not associated with the apoptotic defect and autoimmune phenomena in AILT

Crossref

Springer - Publisher Connector

PubMed Central

Parameterized Melody Generation with Autoencoders and Temporally-Consistent Noise

Author: Alegre Lucas N.
Castro da Silva Bruno
Tørresen Jim
Weber Aline
Publication venue: 'PGDesign / Universidade Federal do Rio Grande do Sul'
Publication date: 01/01/2019
Field of study

We introduce a machine learning technique to autonomously generate novel melodies that are variations of an arbitrary base melody. These are produced by a neural network that ensures that (with high probability) the melodic and rhythmic structure of the new melody is consistent with a given set of sample songs. We train a Variational Autoencoder network to identify a low-dimensional set of variables that allows for the compression and representation of sample songs. By perturbing these variables with Perlin Noise— a temporally-consistent parameterized noise function—it is possible to generate smoothly-changing novel melodies. We show that (1) by regulating the amount of noise, one can specify how much of the base song will be preserved; and (2) there is a direct correlation between the noise signal and the differences between the statistical properties of novel melodies and the original one. Users can interpret the controllable noise as a type of “creativity knob”: the higher it is, the more leeway the network has to generate significantly different melodies. We present a physical prototype that allows musicians to use a keyboard to provide base melodies and to adjust the network’s “creativity knobs” to regulate in real-time the process that proposes new melody ideas

NORA - Norwegian Open Research Archives

Póster: Rosario "Ciudad Candia"

Author: Acosta S
Alegre Juan
Altuzarra César
Beltzer S
Díaz Nora
Fracaroli G
Garcia J.P
Lerro A
Mendez N
Milatich I
Sabre Lucas
Sansarricq Karina
Sosa Guillermo
Publication venue: Universidad Nacional de Rosario. Facultad de Arquitectura, Planeamiento y Diseño
Publication date: 01/06/2018
Field of study

El objetivo general es visibilizar la producción de la empresa, que permitirá realizar un recorrido transversal en el desarrollo arquitectónico local, hilvanando períodos históricos, proyectistas y técnicas constructivas. Es notable como en la historiografía de la arquitectura prevalece la cita del proyectista, relegando a un segundo plano los hacedores que contribuyeron con su saber empírico y fáctico a la construcción de la ciudad.Fil: Secretaria de Ciencia y Tecnología - Universidad Nacional de Rosario. Facultad de Arquitectura, Planeamiento y Diseño; Argentina

Repositorio Hipermedial de la Universidad Nacional de Rosario

CARMA1 is a critical lipid raft-associated regulator of TCR-induced NF-kappa B activation.

Author: A Devin
A Khoshnan
AC Chan
AG Uren
AS Fanning
B Su
BJ Mayer
C Bordier
C Montixi
C Wang
CP Ponting
D Cantrell
D Penna
DF Legler
ED Hsi
J Bertin
J Dierlamm
J Jain
J Ruland
JA Morgan
JR Bradley
K Bi
K Bi
L Wang
LM McAllister-Lucas
LP Kane
M Thome
M Thome
MA Doucey
MD Resh
ML Alegre
N Ghaffari-Tabrizi
O Gaide
O Leupin
P Schneider
PC Lucas
PS Kabouridis
PW Janes
PW Janes
Q Zhang
R Xavier
S Gerondakis
S Ilangumaran
SD Dimitratos
T Akagi
T Harder
T Yoneda
TG Willis
W Zhang
Z Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

CARMA1 is a lymphocyte-specific member of the membrane-associated guanylate kinase (MAGUK) family of scaffolding proteins, which coordinate signaling pathways emanating from the plasma membrane. CARMA1 interacts with Bcl10 via its caspase-recruitment domain (CARD). Here we investigated the role of CARMA1 in T cell activation and found that T cell receptor (TCR) stimulation induced a physical association of CARMA1 with the TCR and Bcl10. We found that CARMA1 was constitutively associated with lipid rafts, whereas cytoplasmic Bcl10 translocated into lipid rafts upon TCR engagement. A CARMA1 mutant, defective for Bcl10 binding, had a dominant-negative (DN) effect on TCR-induced NF-kappa B activation and IL-2 production and on the c-Jun NH(2)-terminal kinase (Jnk) pathway when the TCR was coengaged with CD28. Together, our data show that CARMA1 is a critical lipid raft-associated regulator of TCR-induced NF-kappa B activation and CD28 costimulation-dependent Jnk activation

KOPS - The Institutional Repository of the University of Konstanz

Crossref

Serveur académique lausannois